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ABSTRACT 

A multifunctional reagent based on a coumarin scaf- 
fold was developed for derivatization of naive RNA. 
The alkylating agent N3BC [7-azido-4-(bromo- 
methyl)coumarin], obtained by Pechmann conden- 
sation, is selective for uridine. N3BC and its RNA 
conjugates are pre-fluorophores which permits con- 
trolled modular and stepwise RNA derivatization. 
The success of RNA alkylation by N3BC can be 
monitored by photolysis of the azido moiety, which 
generates a coumarin fluorophore that can be ex- 
cited with UV light of 320 nm. The azidocoumarin- 
modified RNA can be flexibly employed in 
structure-function studies. Versatile applications 
include direct use in photo-crosslinking studies to 
cognate proteins, as demonstrated with tRNA and 
RNA fragments from the MS2 phage and the HIV 
genome. Alternatively, the azide function can be 
used for further derivatization by click-chemistry. 
This allows e.g. the introduction of an additional 
fluorophore for excitation with visible light. 

INTRODUCTION 

Functionalization of RNA is of paramount importance in 
all RNA related research in the life sciences. A plethora of 
labels and functional groups in RNA are in use e.g. for 
identification, sequencing and imaging of RNA, as well as 
for functional and structural studies of RNA and RNA 
protein complexes. Co-synthetic approaches to RNA 
functionalization make use of modified building blocks, 
carrying either the desired functionality or a functional 
group that allows selective derivatization in a further step. 



This concept is widely developed in a huge number of 
modified phosphoramidites for RNA solid phase synthesis 
(1-3). For example, the incorporation of fluorescent labels 
as built-in probes may be achieved via phosphoramidites 
of fluorescent nucleotide analogs, via building blocks of 
conjugates of fluorescent dyes and canonical nucleotides 
or via phosphoramidites carrying a reactive moiety as 
an attachment site, e.g. a primary aminofunction for 
later derivatization with NHS-derivatives of fluorophores 
(1,3,4). Co-synthetic functionalization has also been de- 
veloped for RNA synthesis via run-off in vitro transcrip- 
tion, e.g. for nucleotide analog interference mapping 
(NAIM) studies (5). However, the range of nucleotide 
analog triphosphates (XTPs) successfully incorporated 
into RNA is severely restricted, due to limitations on 
size and base pairing properties imposed by the RNA 
polymerase. Successfully incorporated XTPs include a 
number of naturally occurring modified nucleotides with 
unaltered Watson-Crick faces (6), numerous nucleobase 
analogs carrying additional functionalities attached to the 
pyrimidine C5 position (7-9), as well as oc-phosphorothio- 
ates (5) and NTPs modified at the 2'-OH (10,11). 
Incorporation of XTP building blocks may be quantita- 
tive, instead of the respective natural NTP (12), or in a 
mixture of natural NTP and its analog, where the ratio 
determines the extent of incorporation. In the latter case, 
incorporated modifications are randomly distributed over 
the entire transcript length (5,9). Site-specific incorpor- 
ation, a typical feature of phosphoramidite chemistry, is 
much more difficult to achieve for RNA transcripts, 
because it requires additional information to be processed 
by the polymerase. In an extension of the genetic code, 
RNA molecules with enhanced functionality, including 
fluorescent probes and attachment sites for crosslinking 
moieties, have been obtained by incorporation of several 
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XTPs, which form unnatural base pairs (13-17). Random 
incorporation of attachment sites is not restricted to 
in vitro synthesis of RN A, as is illustrated by the biological 
incorporation of 5-ethinyluridine in living cells. Such 
in vivo synthesized RNA was made visible via click conju- 
gation of a fluorescent dye (18). 

An area much less developed is the functionalization 
of naive RNA, i.e. RNA synthesized in the absence 
of modified building blocks, be it in vivo, by in vitro tran- 
scription or by phosphoramidite chemistry. Post-synthetic 
functionalization can be site-directed, e.g. by post- 
transcriptional modification enzymes. Indeed, many early 
studies on tRNA derivatization make use of pre-existing 
native modifications (19,20). We have recently used 
RNA:methyltransferases to introduce alkynylgroups into 
RNA as an attachment point for further derivatization via 
click chemistry (21). However, in certain applications, 
site-specificity is not a desirable attribute, e.g. when not 
all sequence information is available as is the case when 
mixtures of various RNA species are to be functionalized. 
Another application with a particular demand for versa- 
tile, random post-synthetic introduction of functional 
groups into RNA is the identification of RNA protein 
contacts by UV-irradiation (22). A popular modification 
for crosslinking studies are thiouridines, which occur as a 
native modification in tRNA, and can be co-synthetically 
incorporated into RNA via solid phase synthesis or into 
transcripts as an XTP. Thiouridines produce UV- 
dependent crosslinks by themselves (22-24) or can be 
used as an attachment point for aromatic azides (23). 
This co-synthetic introduction of modifications as a pre- 
requisite exempts a variety of RNAs from use in cross- 
linking studies, a drawback that we seek to overcome. 

Azides, which are relatively inert under physiological 
conditions, have lately become very popular functional 
groups in bioconjugate chemistry. This recent develop- 
ment is mostly due to their efficient orthogonal 
functionalization by the copper(I)-catalyzed azide-alkyne 
Huisgen cycloaddition (CuAAC) reaction (25-27). This 
archetypical 'click' reaction (28) is known to proceed 
between an azide and an alkyne-containing moiety with 
high selectivity i.e. even in the presence of various func- 
tional groups commonly encountered inside a living cell 
and is therefore also known as 'bio-orthogonal' (29). 
Moreover, the reaction yield is generally very good even 
at low concentrations of both partners and mild reaction 
conditions can be applied that are non-destructive with 
respect to proteins and DNA [e.g. (27)]. Applications to 
RNA have only very recently become popular (18,21,30- 
32). On the other hand, aromatic azides are also known as 
photoaffinity labels in crosslinking experiments (23,33- 
36). Alkylating agents commonly applied to RNA so far 
are typically monofunctional, and selective post-synthetic 
RNA derivatization is hampered by the fact that few 
reagents show an exploitable degree of selectivity for a 
singular nucleophilic moiety in the nucleobases (37,38). 

Here, we present N3BC [7-azido-4-(bromomethyl)-2H- 
chromen-2-one] as a versatile multifunctional reagent, 
which allows the controlled post-synthetic functional- 
ization of RNA. N3BC selectively alkylates uridine 
residues, thereby introducing an azido-chromophore that 



allows spectroscopic verification of the reaction efficiency, 
photochemical crosslinking and versatile post-synthetic 
derivatization by click chemistry. 

MATERIALS AND METHODS 

Reaction of N3BC with nucleosides and RNA 

The details of the synthesis of N3BC is described in the 
Supplementary Data. Nucleosides adenosine, guanosine, 
uridine and cytidine and RNA homopolymers poly-rA, 
poly-rC, poly-rG and poly-rU were from (Sigma- 
Aldrich, Munich, Germany) and pseudouridine was 
from (Berry & Associates, Dexter, MI, USA). RNA 
binary and ternary oligonucleotides were from Biomers, 
Ulm, Germany [G/U (3'-GGUUGUGGUGGUUUGUU 
GGU), G/C (3'-CCGGCCCGGCGGCCGGCGCG), 
U/C (3'-UUCUUUCUUUCCCUUCCUUU), U/A (3'-U 
UAUAUUUAUAAAUAUUAAA)], Iba, Gottingen, 
Germany [CUG: (3'-CUUCGUUCGCUGGUC, GAC: 
(3'-GACCAGCGAACGAAGCAGG)] and Sigma- 
Aldrich [G/A (3' - A AAGG A AGGG AAAG AAAG AA) , 
C/A (3'-ACCAACAAACCACCACAACC), G/C (3'-CC 
GGCCCGGCGGCCGGCGCG)]. In vitro transcription 
was conducted as previously described for S. cerevisiae 
tRNA phe (21), and MS2 and HIV-derived RNAs (39), 
respectively. 

N3BC conjugation to nucleosides and HPLC analysis 

Each of the four major ribonucleosides and pseudouridine 
were separately reacted in the dark at a final concentration 
of 0.4 mM with N3BC (8 mM) for 1 h at 37°C in a solution 
containing 50% DMSO and 100 mM sodium phosphate 
buffer at appropriate pH (e.g. pH 9.0 for reactions 
analyzed in Figure 2). Reaction mixtures were analyzed 
on an Agilent HP 1100 series equipped with DAD and 
FLD (Excitation 350 nm, emission 470nm). A Synergy 
Fusion RP column (4um particle size, 80 A pore size, 
250 mm length, 2 mm inner diameter) from Phenomenex 
(Aschaffenburg, Germany) with a guard column was used 
at 35°C. The solvents consisted of 5mM ammonium 
acetate buffer adjusted to pH 5.3 using acetic acid 
(solvent A) and pure methanol (solvent B). The elution 
started with isocratic flow of 100% solvent A for 3min, 
followed by a linear gradient to 30% solvent B at lOmin, 
then to 70% solvent B at 20min and further to 80% 
solvent B at 25min. Initial conditions were regenerated 
by rinsing with 100% solvent A for lOmin. The flow 
rate was 0.3 ml/min. 

N3BC derivatization and workup of RNA 

RNA (10-20 (iM final concentration) was incubated with 
lOmM N3BC (200 mM stock solution in pure DMSO), 
100 mM phosphate buffer pH 8.5 and 70% DMSO at 
37°C for 180 min under light protection. After the reaction, 
three volumes of 0.5 M NH 4 OAc and eight volumes 
of ethanol were added, the mixture was incubated at 
— 20° C (2h) and RNA was precipitated by centrifu- 
gation at 15000g at +4°C. The RNA was dissolved in 
50 ul of pure water, subjected to gel-filtration on an 
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Illustra G-25 column (GE Healthcare, Munich, Germany) 
and re-precipitated as above. 

PAGE analysis of N3BC treated RNA 

The concentration of N3BC-conjugated RNA was deter- 
mined using a Nanodrop-ND-1000 (Peqlab, Erlangen, 
Germany) and 1 00-300 pmol (exact amounts are specified 
in the text) were analyzed on a 15-20% urea gel after 
exposure to daylight for 1 h. Blue fluorescence of 
coumarin derivatives was observed upon radiation with 
UV light (365 nm) and imaged with a GelDoc transillumi- 
nator (Peqlab, Erlangen, Germany). Afterwards, the RNA 
was stained for 10 minutes either with GelRed (according 
to the instructions of Biotium, Hayward, CA, USA) or 
with StainsAll solution (0.65x TBE, 0.01% StainsAll, 
10% formamide, 25% isopropanol and 65% water), 
followed by destaining for 2 h in 1 x TBE buffer contain- 
ing 25% isopropanol. 

Thin layer chromatography of N3BC treated RNA 

N3BC treated RNA (100 pmol) was digested to nucleo- 
tides with 0.3 U of nuclease PI and spotted on a cellulose 
plate (Merck, Darmstadt, Germany ref#l. 05577. 0001) 
and run in NI solvent (isobutyric acid 330 ml, ammonium 
hydroxide 25% 5 ml, water to 500 ml) for the first dimen- 
sion and RII solvent (0.1 M Na 2 HP0 4 /NaH 2 P04 buffer, 
pH 6.8, 500 ml, ammonium sulfate 300 g, 1-propanol 
10 ml) for the second dimension (40). TLC plates were im- 
aged with an Alphalmager (Alphalnnotech, San Leandro, 
USA) at simultaneous impinging illumination at 254 and 
365 nm (Supplementary Figure S4). 

Conditions for click reaction with fluorescein-alkyne and 
AlexaFluor 647-alkyne 

AlexaFluor 647-alkyne was from Invitrogen (ref# A 10278) 
and fluorescein-alkyne was synthesized according to (41). 
The click reaction between N3BC-tRNA and 
alkyne-containing fluorescent dye was performed in final 
volume of 10 ul at RT in the dark in PUS buffer (100 mM 
Tris-HCl, pH 8.0, lOOmM NH 4 OAc, ImM MgCl 2 ). To 
catalyze the reaction between azide and alkyne moieties, 
Cu + was generated in situ by Na-ascorbate reduction of 
Cu 2+ . In our tests we used the THPTA ligand (2.5 mM, 
kindly provided by Dr E. Weinhold, Aachen, Germany), 
CuS0 4 (0.5 mM) and Na-ascorbate (5mM). Alkyne- 
coupled fluorescent dye was used at 50 uM final concen- 
tration. Reaction time was 2 h, unless indicated otherwise. 
Reaction products were analyzed by 10% urea-PAGE. 
Fluorescein-alkyne does not co-migrate with tRNA 
during electrophoresis, and thus its removal is not neces- 
sary, while AlexaFluor 647 co-migrates with the tRNA 
band and has to be removed by prior gel-filtration 
through Illustra G-25 columns. Staining with 
SYBRGold (Invitrogen) was performed according to the 
manufacturer's instructions. The gels were scanned for 
fluorescence on a Typhoon (GE Healthcare) using 
preexisting settings for fluorescein-alkyne (Green laser 
532 nm excitation, 526 SP filter, 650 V PM Voltage) and 
Alexa 647-alkyne (Red laser 634 nm excitation, 670 BP 30 
filter, 450 PM Voltage). 



N3BC conjugation of a- 2 P-GTP labeled tRNA Phe and 
RNase Tl digest 

In vitro transcription of tRNA phe was performed in 
the presence of [a- 32 P]-GTP for 3h at 37°C followed by 
urea-PAGE for transcript purification. Aliquots 
(1-3 pmol) of RNA were digested with 500 U of RNase 
Tl (Fermentas) in lx Tl buffer (12.5 mM Na-citrate, pH 
4.5, 0.5 mM EDTA and 3.5 M urea) for 20min at 42°C. 
N3BC treated tRNA phe (10 fig) was digested under the 
same conditions. The resulting fragments were then 
conjugated to AlexaFluor 647-alkyne (as described above). 
RNase Tl digests were separated on a 20% sequencing gel 
and signals from radioactive and fluorescent labels were 
correlated taking into account a relative retardation of 
fluorescent fragments due to the AlexaFluor647-label 
(Supplementary Figure S7). 

Fluorescence increase measurement of N3BC-polyU and 
N3BC-tRNA Phe upon UV irradiation 

A sample of N3BC-polyU or N3BC-tRNA phe (-0.5-1 ug) 
in PUS buffer was irradiated with an 8 W hand-held UV 
365 nm lamp in the spectrofluorimeter cuvette. 
Fluorescence spectra were recorded at the indicated time 
points using a JASCO FP-6500 spectrofluorimeter 
(Excitation 360 nm, 5nm bandwidth, emission scan 370- 
500 nm, 10 nm bandwidth). 

LC/MS method and analysis 

Digestion of RNA for LC/MS (liquid chromatography 
coupled to mass spectrometry) analysis. Naive or 
N3BC-treated RNA (final concentration 1 ng/|il) was 
dissolved in 20 mM NH 4 OAc, pH 5.3 and incubated for 
5h at 37°C in the presence of 3 U nuclease PI (Roche 
Diagnostics, Mannheim, Germany) per 100 fig RNA. 
Snake venom phosphodiesterase (Worthington, 
Lakewood, USA) was then added to a concentration of 
0.06 U per 100 ug RNA and the mixture was incubated at 
37°C for another 2h. Finally to convert the resulting 
mixture of mononucleotides to free nucleosides, 1/10 vol 
of lOx SAP buffer (Fermentas, St. Leon-Roth, Germany) 
was added, followed by 3/20 vol of H 2 0 and 1/4 vol of 
Shrimp Alkaline Phosphatase (SAP stock at 1 U/ul; from 
Fermentas, St. Leon-Roth, Germany) The mixture was 
incubated for 1 h at 37° C. 

LC/MS/MS instrumental analysis parameters. The LC/ 
MS/MS system (Thermo Electron, Dreieich, Germany) 
consisted of a Surveyor HPLC (quaternary Surveyor LC 
pump plus, Surveyor autosampler plus with integrated col- 
umn heater and cooled sample tray and double- 
wavelength Surveyor UV-Vis plus) and a triple stage 
quadrupole mass spectrometer (TSQ 7000 with API-2 
ion source and performance kit). Chromatography was 
performed under the conditions similar to those detailed 
above (section: N3BC conjugation to nucleosides and 
HPLC analysis) except that acetonitrile was used as 
an eluent. The gradient conditions for mass scan and se- 
lective reaction monitoring, are given in Supplementary 
Tables SI and S2, respectively. UV-detection was per- 
formed at 254 and 320 nm. Subsequently, the eluent was 
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introduced without splitting into the electrospray ion 
source (ESI) of the mass spectrometer. ESI interface par- 
ameters were as follows: middle position, spray voltage 
4.5 kV, sheath gas (N 2 ) 90psi, aux gas (N2) 10 scales, ca- 
pillary heater temperature 350°C. For identification of nu- 
cleosides, azides and derivatives, the mass spectrometer 
scanned the mass range from mjz 160 to m/z 600 (scan 
time 1 s) in the single MS mode. For selective and sensitive 
detection as well as quantification, selected reaction moni- 
toring measurements were performed at 1.5 kV multi- 
plier voltage in the MS/MS mode. MS/MS transitions 
monitored are given in Supplementary Table S3. 

UV-crosslinking of N3BC-modified RNA 

5'- 32 P-labeling of RNA. About 80pmol of N3BC- treated 
RNA or control unmodified transcript were first treated 
by 8 U of SAP in the appropriate buffer, the phosphatase 
was inactivated at 65°C for 15min and transcripts were 
ethanol precipitated. Labeling was performed with 
lOpmol of [y- 32 P]-ATP using T4 polynucleotide kinase 
in labeling buffer A (Fermentas, St. Leon-Roth, 
Germany). Unincorporated [y- 32 P]-ATP was removed by 
Illustra G-25 spin columns and 5'- P-labeled tRNA tran- 
scripts were purified by 10% urea-PAGE. 

Conditions for RNA-RNA crosslinks. All crosslinking ex- 
periments were performed in PUS buffer (lOOmM Tris- 
HC1, pH 8.0, 100 mM NH 4 OAc, ImM MgCl 2 ). For 
internal crosslinking, ~1 pmol of 5'- 32 P-labeled transcript 
was dissolved in PUS buffer. Crosslinking was performed 
using a hand-held 8W CAMAG UV Lamp with dual 
wavelength (254/365 nm) selection, (ref 022.9120, 
CAMAG, Switzerland) equipped with 254 nm (ref 
352.010) and 365 nm (ref 352.011) tubes. Samples for ir- 
radiation were placed into a V-shaped polypropylene 
96-well plate on ice and the UV-lamp equipped with a 
UV-transparent glass plate was placed on top at a 
distance between UV-tube and sample of ~10mm. After 
exposure for the indicated period of time, transcripts were 
analyzed by 10% urea-PAGE and 32 P-imaging was per- 
formed using a Typhoon phosphorimaging screen (GE 
Healthcare). 

Conditions for RNA-protein crosslinks. His-tagged recom- 
binant RNA modification enzymes were prepared by 
Immobilized Metal Ion Affinity Chromatography 
(IMAC) or GST-affinity chromatography using proced- 
ures described previously for scTrm4 (42), tmTruB (43), 
pfTrml (44), paTrml (45) and paL7ae (46). MS2-MBP 
protein was purified by affinity chromatography on 
amylose column followed by Heparine-Sepharose chroma- 
tography (47). Recombinant hnRNP Al protein was 
prepared as described previously (48). Preparation of 
HIV-1 derived RNAs SLS2 and SLS123 was described 
previously (49). Preparation of Pab21 RNA and S 22 
mer RNA was done as described in (50). RNA-protein 
crosslink experiments were performed with 1 pmol of 
5'- 32 P-labeled transcript and ~1 ug of different RNA 
binding and control proteins in PUS buffer. Irradiation 
with hand-held 8W UV lamp was performed at UV 
365 nm or at 254 nm for 30min. After incubation, 



RNA-protein crosslinked products were directly loaded 
onto 10% SDS gel. Protein staining was done by 'Rapid 
Stain' (G-Biosciences, St. Louis, USA) and gels were 
imaged in a wet state using a Typhoon screen. 

RESULTS 

The search for a multifunctional RNA derivatization 
agent was guided by the necessity to accommodate a 
reactive electrophilic moiety and a photoreactive azide in 
a small molecule with properties favorable for post- 
modification analytics. Coumarin appeared as a promising 
scaffold because it shows absorption ~320nm, thus in a 
UV range where nucleic acids do not absorb. Recent work 

(51) identified certain dark coumarin azides as pre- 
fluorophores, which display strong fluorescence after de- 
struction of the quenching azide moiety. These proper- 
ties conceivably allow identifying coumarin residues in 
RNA and thus offer a potential to monitor reaction 
progress. 

Synthesis of N3BC and reaction with nucleosides and 
RNA homopolymers 

Starting from the known reaction of 4-(bromomethyl)- 
7-methoxy-2H-chromen-2-one with modified uridines, 
including in particular thiouridines and pseudouridine 

(52) , we hypothesized that changes in the substitution 
pattern of the coumarin ring in 4-(bromomethyl)-couma- 
rins might modify the reactivity such that alkylation of 
otherwise unmodified RNA would become possible. 
Simultaneously, replacement of the electron donating 
7-methoxy group with the electron withdrawing azide 
would accomplish the accommodation of an azide group 
for potential later photo-crosslinking and click chemistry. 
We thus devised a five-step synthesis that yielded N3BC 
(1) from 3-aminophenol (2) with an overall yield of 44% 
(Figure 1). 

The light-sensitive compound 1 (termed N3BC) shows 
low fluorescence, possibly due to quenching by both the 
bromo- and the azidogroup (51). Its typical chromophore 
allows the detection of coumarin-containing reaction 
products by monitoring the absorption at 320 nm in 
RP18-chromatography runs (Figure 2A). The choice of 
initial pH and solvent conditions was guided by consider- 
ations of reactivity, selectivity and RNA stability. While 
RNA chains are increasingly prone to phosphodiester hy- 
drolysis as a result of general base catalysis, reactivity 
is expected to increase with the deprotonation status of 
nucleobases at higher pH. Initial experiments had 
indicated a pH range between 8 and 9, a solvent content 
of 50% (v/v) DMSO and a temperature of 37°C as a rea- 
sonable starting point from which to explore reactivity 
and selectivity. To detect nucleoside alkylation products, 
each of the four major ribonucleosides was incubated 
with N3BC and the reaction mixtures were analyzed 
by LC. As would be expected from a reactive compound, 
a mock incubation of N3BC without nucleosides showed 
decomposition into numerous coumarin products, as de- 
tected in UV-traces at 320 nm (Figure 2A, lower trace). 
In comparisons to the other UV-traces in Figure 2A, 
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Figure 1. Synthesis of 7-azido-4-(bromomethyl)-2H-chromen-2-one 1 (N3BC). Experimental details of the Pechmann condensation are described in 
the supplement. 



only uridine shows a covalent adduct containing both 
nucleoside and coumarin moiety. Pseudouridine, as the 
most abundant minor nucleoside, was also assayed 
and found equally reactive (52) (see Supplementary 
Figure SI). 

To the exclusion of light, the reaction product of uridine 
and N3BC, termed UN3C (Figure 2C), was isolated in 
microgram quantities from HPLC runs, which allowed 
its characterization by UV-absorption (Figure 2D), fluor- 
escence (Figure 2E and F) and LC-MS (see below). 
Intriguingly, LC-traces show very low fluorescence of 
UN3C, but solutions of the isolated compound become 
increasingly fluorescent upon irradiation at 365 nm 
(Figure 2E and F). This strongly suggests that the charac- 
teristic coumarin fluorescence is quenched by the 
azidofunction (51), and that fluorescence is restored by 
its photochemical destruction. Photochemical conversion 
to the fluorescent reduction product of a presumed nitrene 
intermediate (see Supplementary Figure S2) could be 
monitored in an HPLC/UV/FLD and LC/MS analysis 
of the irradiated UN3C (See Supplementary Figure S3). 

When the investigations were extended to the four RNA 
homopolymers, only N3BC-treated polyU showed a 
coumarin chromophore in absorption spectra. Upon 
UV-irradiation, N3BC-treated polyU, but not the other 
RNA homopolymers, showed increasing blue fluorescence 
as well, providing a convenient means to monitor the 
success of the reaction (See Supplementary Figure S3E). 
A PAGE analysis of N3BC treated homopolymers failed 
because of extremely heterogeneous size distribution. 
After digestion to nucleosides, only the UV-trace of the 
reaction mixture of N3BC with polyU (Figure 2B) showed 
the presence of a coumarin-containing product, which cor- 
responded to UN3C in its retention time, UV absorption 
and fluorescence properties. 

Reaction of N3BC with binary RNA oligomers 

Because RNA homopolymers do not always behave like 
typical RNA samples, the analysis was extended to further 
RNAs, starting with oligonucleotides of binary 
nucleobase composition. In a re-evaluation of reaction 



conditions, the DMSO content was increased to 70% in 
order to denature residual RNA secondary (2D) and 
tertiary (3D) structure and to promote reactivity. 
Reacted oligomers were precipitated from the reaction 
mixture and small molecules were removed by microscale 
gel filtration. 

To exploit fluorescence of the coumarin chromophore 
by photochemical destruction of the quenching azide 
moiety (Figure 2E and F), samples were exposed to 
daylight for at least 1 h prior to electrophoresis and gels 
were run without light protection. As seen in the PAGE 
analysis in Figure 3, reactive oligomers show coumarin 
fluorescence. Figure 3A shows assays of the A/C, A/G 
and A/U oligomers at pH values between pH 8 and 9. 
In keeping with the previous analysis, the A/C oligomer 
does not show any fluorescence, while the A/U oligomer 
gives rise to a strong signal, originating presumably from 
UN3C. In contrast to the above results obtained with 
RNA homopolymers, a weak but measurable signal is 
apparent for the A/G oligomer, presumably originating 
from a reaction of guanosines with N3BC. Notably, this 
signal is much weaker than that from the A/U oligomer, 
and it is only detectable when at least 200 pmol of the A/G 
oligomer are loaded onto the gel, a quantity correspond- 
ing to ~1.5nmol of guanosine residues per band. 

The results in Figure 3A thus suggest a strong reaction 
of N3BC with uridine residues in RNA and an additional 
weak side reaction with guanosines, while cytidines and 
adenosines remain unreactive. For confirmation, the re- 
maining binary oligomers were investigated. In this and 
the following reactions, the pH was set to 8.5 and the 
DMSO content was kept at 70%. Figure 3B shows a 
PAGE analysis after N3BC modification reactions of 
20-mers of all six possible binary permutations of nucleo- 
tides. In contrast to homopolymers, single bands of 
these bipartite oligomers appear in the loading control 
(Figure 3B, StainsAll stain) except for the species 
composed of cytidine and guanosine residues (termed G/ 
C-oligomer). The smeared signal observed for the latter 
species is reproducible for samples of different origin and 
therefore likely due to incomplete denaturation during 
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Wavelength [nm] Time [min] 

Figure 2. Reactivity of N3BC with RNA nucleosides and homopolynucleotides. Reactions of N3BC with free nucleosides and homopolynucleotides 
were performed as described in 'Materials and Methods' section. Reaction products with nucleosides were directly used for HPLC analysis (A), while 
homopolynucleotides (polyC, polyU etc.) were precipitated to remove unreacted N3BC, and were then digested to free nucleosides for HPLC analysis 
(B). Only uridine forms a reaction product with N3BC, termed UN3C, which is indicated by an arrow in (A) and (B). Note that N3BC itself is so 
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Figure 3. In-gel detection of RNA-coumarin conjugates of binary 
oligomers. (A) Polyacrylamide gel of binary oligomers A/C, A/G and 
A/U (200pmol each) after reaction with N3BC at pH 8.0, 8.5 and 9.0 
(indicated on the bottom of the panel). Samples were exposed to 
daylight for 1 h prior to PAGE. The loading control with StainsAII 
was imaged with a conventional digital camera, and fluorescence 
upon excitation at 365 nm was imaged with a GelDoc. Note that the 
loading control developed with StainsAII does not allow comparison 
among binary oligomers of different composition (compare 
Supplementary Figure S4). (B) Polyacrylamide gel of binary oligomers 
(300 pmol each) of all six permutations of nucleobases (the nucleobase 
composition is indicated above the gel image) after reaction with N3BC 
at pH 8.5 (right panels) or mock incubation (left panels). Imaging was 
done as described in (A) Note that the signal of the GC oligomer 
smears in the loading control as a consequence of incomplete 
denaturation. 



electrophoresis. For each sample, the amount of RNA was 
determined by UV absorption prior to gel loading. 
Interestingly, some binary RNA compositions led to dif- 
ferential staining using common RNA-staining dyes. A/C 
and C/U-oligomers could not be efficiently stained with 
ethidium bromide or GelRed (Supplementary Figure S4). 
Although StainsAII reveals oligomers of all binary com- 
positions, staining efficiency is disparate and does not 
allow quantitative comparison among binary oligomers 
of different composition. 

Aliquots of the above reactions were digested to 
mononucleotides and analyzed by two-dimensional TLC 
in solvent systems established to detect natural RNA 
modification. Coumarin fluorescence was observed 
for all binary oligomers containing uridines (corres- 
ponding to photolysis products of UN3C) but not for 
other binary oligomers (see Supplementary Figure S5). 
Thus, the in-gel detection is more sensitive, presum- 
ably because bands in the gel are more focused than 
TLC spots. 



LC/MS characterization of N3BC reaction products 

To verify these findings, LC/MS analysis was employed, 
piloted by characterization of UN3C isolated from the 
initial nucleoside reactions. In LC/MS runs, the UV- 
signal of UN3C coincided with a [M + H] + signal of m/z 
444, which is consistent with N3-alkylation (Figure 2C). 
From the tandem mass spectrum in Figure 4, the m/z 444 
— > m/z 284 mass transition was identified and used to 
develop an LC/MS/MS method for the sensitive detection 
of uridine and its adduct UN3C (see Supplementary Data 
for details). Using this method we were able to detect 
UN3C in various N3BC-treated RNA species, including 
polyU, binary oligomers, ternary oligomers and an 
in vitro transcript of tRNA phe from S. cerevisiae (see 
Supplementary Figures S6 and S7). Figure 4 shows the 
LC/MS analysis of an N3BC-treated ternary oligonucleo- 
tide (CUGGUCGCUUGCUUC), displaying a clearly 
visible UN3C peak at ?=11.75min. Screening this 
RNA for the expected mass of an adduct of N3BC with 
guanosine, i.e. an [M + H] + signal of m/z 483, revealed 
three faint signals at retention times 11.00, 11.12 and 
11.55min. Inspection of the tandem mass spectrum sug- 
gested fragmentation behavior similar to the UN3C 
compound. The origin of three separate peaks may be 
related to alkylation by N3BC of different nitrogens in 
guanosine, although this hypothesis cannot be confirmed 
by LC/MS/MS because fragmentation occurs at or near 
the glycosidic bond. Assuming a similar response factor 
for UN3C and the guanosine adducts of N3BC, and 
taking into account the relative abundance of G and U 
residues in this oligomer, a conservative estimate of the 
relative abundance of both compounds (areas under the 
curves) amounts to a ratio of ~1:25, which is consistent 
with fluorescence intensities observed in Figure 3. 

Influence of RNA structure on reactivity towards N3BC 

To evaluate the influence on UN3C formation of potential 
blocking of uridine-N3 by base pairing in the helical RNA 
structure, 100 pmol of each of two N3BC-treated tripartite 
CUG and GAC oligomers were imaged after PAGE. 
These conditions were deliberately chosen such that 
coumarin fluorescence resulting from guanosine adducts 
is not detectable (400 and 700 pmol of G residues per 
band, respectively). Accordingly, only the blue fluores- 
cence of the oligomer containing uridine residues is 
visible (Figure 5, lanes 4 and 6). These oligomers, which 
are complementary with only a small 5'-overhang of the 
GAC oligomer, were treated with N3BC separately and as 
an equimolar mixture. Under both conditions, the CUG 
oligomer in the corresponding lanes 4 and 6 in Figure 5 
shows identical blue coumarin fluorescence, which can be 
distinguished by the naked eye. Since the presence of the 
complementary GAC oligomer does not affect the 



Figure 2. Continued 

unpolar that it elutes in the later part of the gradient which is not displayed. (C) Reaction of N3BC with uridine to UN3C. (D) UV-spectra of N3BC, 
uridine and UN3C. Arrows indicate wavelengths used to monitor aromatic rings (254 nm, upper panels in A and B) and coumarins (320 nm, lower 
panels in A and B). (E) Fluorescence kinetic of a solution of UN3C upon UV-irradiation at 365 nm. Full emission spectra were recorded at selected 
time points. (F) Time course of the fluorescence intensity at 450 nm plotted from the data in (E). 
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Figure 4. LC/MS/MS analysis of reaction products of N3BC with uridine and guanosine residues in RNA. (A) Mass spectrum, structure and main 
fragmentation of positively charged [M + H] + of conjugates of N3BC to uridine (upper panel) and guanosine (lower panel). Mass transitions used in 
(B) are indicated by arrows. The structure of the guanosine product is one out of three possible reaction products of nitrogen alkylation. (B) LC/MS/ 
MS transition trace for guanosine adducts (lower trace) and the uridine adduct (middle trace). The total ion count (TIC) in the upper trace is the 
weighed sum of the lower two traces. Note the disparate relative intensities in the TIC trace: the smaller peak corresponds to guanosine adducts and 
resolves into three species seen in the lower trace. 



reaction of N3BC with the CUG oligomer, it is concluded 
that the denaturing effect of 70% DMSO content is suffi- 
cient to render all uridines accessible to alkylation. 
Interestingly, lowering the DMSO content to 40% 
protects uridines against alkylation by N3BC in the 
same experimental setting (Supplementary Figure S8), 
suggesting that elements of secondary structure are 
forming in 40% DMSO. 

To further elucidate, if strongly structured elements in 
RNA prevents its reaction with N3BC in 70% DMSO, we 
have investigated yeast tRNA phe , generated by in vitro 
transcription. This tRNA is one of the working horses 
of RNA structural biology, and a paradigm of a 
strongly structured RNA, whose intricate 3D structure is 
well understood from crystallography (53). In order to 
increase the sensitivity of coumarin azide detection in 
N3BC-treated RNA, it was decided to exploit the 



inherent potential of the aromatic azide for click reactions. 
Recently, we demonstrated the application of click- 
reaction to alkyne-derivatized tRNAs and optimized reac- 
tion conditions in order to avoid RNA degradation (21). 
Thus, N3BC-treated in vitro transcript of tRNA phe , 
estimated by LC/MS analysis to contain ~10% of 
N3BC-modified uridines, was incubated with a fluorescein 
terminal alkyne derivative under Huisgen-type cycloadd- 
ition conditions. These include, in particular, the presence 
of copper-(I) ions, which are typically generated in situ by 
reduction of Cu-(II) with ascorbate. Copper-(I) ions were 
stabilized by addition of the chelating ligand THPTA. To 
analyze derivatization by Huisgen-type cycloaddition, the 
RNA was subjected to urea-PAGE and the gel was 
scanned for fluorescence. The left panel in Figure 6 
shows a fluorescence emission scan upon fluorescein exci- 
tation at 532nm. The scan reveals a distinct fluorescent 
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band co-migrating with the tRNA. The appearance of this 
click-adduct is dependent on the presence of Cu(I) in the 
reaction mixture, product formation is proportional to 
the amount of N3BC-modified RNA in the reaction, 
and the reaction does not occur without prior treatment 
of the RNA with N3BC. Unreacted dye can be detected 
migrating in the gel. The SYBR Gold stain in the lower 
panel clearly shows that RNA is stable under click condi- 
tions. Identical results were obtained with a terminal 
alkyne derivative of the AlexaFluor 647 dye, except that 
the unreacted alkyne-dye co-migrates with tRNA and was 



3'-GACCAGCGAACGAAGCAGG-5' 
5'-CUGGUCGCUUGCUUC-3' 
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Figure 5. Influence of RNA secondary structure on UN3C formation 
investigated by in-gel detection of RNA-coumarin conjugates. The two 
ternary GAC and CUG oligomers shown in the upper part are com- 
plementary save for a 5'-overhang on the GAC oligomer. The oligo- 
mers were reacted with N3BC separately (lanes 5 and 6), and as an 
equimolar mixture (lane 4). Lanes 1-3 show the corresponding mock 
incubation controls. The samples were run on a denaturing PAGE and 
stained with GelRed. Upon excitation at 365 nm, the fluorescence was 
imaged with a conventional digital camera. The blue coumarin fluores- 
cence can be clearly distinguished from the red fluorescence of the 
GelRed stain. Note that coumarin conjugates migrate more slowly 
than unreacted RNA. 



therefore removed by spin column gel filtration prior to 
PAGE analysis (Figure 6, right panel). We conclude that 
serial treatment of RNA with N3BC and a fluorescent 
terminal alkyne under click conditions constitutes a new 
way for post-synthetic fluorescent labeling of RNA that 
takes place selectively on the N3 of uridines. 

To trace the distribution of coumarin azide residues 
throughout the tRNA sequence, material obtained from 
in vitro transcription in the presence of [a- 32 P]-GTP was 
submitted to complete digestion with RNase Tl. This 
combination results in the generation of short RNA frag- 
ments containing one 32 P-labeled guanosine residue on 
their respective 3'-ends. Separation of such a 'G-catalog' 
on a denaturing 20% PAGE resolves most of the frag- 
ments of 1-8 nt in size (See Supplementary Figure S9). 
When a G-catalog of N3BC-treated tRNA was click- 
labeled with Alexa 647 azide, fluorescence signals were 
detected for at least seven fragments. Mapping of the 
G-catalog onto the fluorescence signals (See 
Supplementary Figure S6 for details of the mapping) 
reveals that, although the technique does not allow 
complete coverage of all fragments, coumarin residues 
are clearly not confined to any particular region of the 
tRNA. Clearly, a content of 70% DMSO is sufficient to 
denature even highly structured RNAs such as this tRNA, 
and to render all uridines equally accessible to alkylation 
by N3BC. Application of the 'click-detection' developed 
above also allowed us to trace low amounts of N3BC- 
conjugates obtained with RNA oligomers at DMSO 
concentrations as low as 20% (see Supplementary Figure 
S10). Although a decrease in DMSO content dramatically 
decreases alkylation yield, these data suggest, that for a given 
RNA, an optimization of the DMSO content should allow 
to choose between alkylation under denaturing or under 
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near-native conditions. Consequently, the resulting RNAs 
would bear UN3C equally distributed over the RNA 
sequence, or preferentially at those positions, whose 
uridine-iV5 is not involved in features of the native 
structure. 

Application of N3BC-treated RNA in crosslinking studies 

During initial investigations of the light sensitivity of 
N3BC derivatives isolated from nucleoside reactions, a 
variety of fluorescent peaks were observed in LC analysis 
of irradiated material (Supplementary Figure S2), presum- 
ably arising from various reaction pathways undergone by 
a coumarin-nitrene intermediate, which was generated by 
photochemical elimination of N 2 from UN3C (35). A peak 
at 18.2 min displays high coumarin fluorescence and a mo- 
lecular mass of m/z 418 (See Supplementary Figure S2), 
which is consistent with an aromatic amine, a known re- 
duction product of nitrenes (35). Nitrenes are an electron 
deficient species known to undergo a variety of insertion 
reactions, which leads to an important application in 
crosslinking studies (23,33,35,36). Because the observed 
photoreactivity suggested the potential application of 
UN3C-containing RNA in crosslinking reactions to cog- 
nate proteins, this possibility was exploited in detail. 
5'- 32 P-labeled aliquots of the above N3BC-treated 
tRNA phe were irradiated with UV-light (365 nm) in the 
presence of proteins known to bind this particular 
tRNA. The SDS-PAGE in the Figure 7A clearly shows 
efficient formation of intermolecular crosslink bands with 
a variety of cognate tRNA modification enzymes, 
including methyltransferases and pseudouridine synthases 
known to specifically recognize features of tRNA archi- 
tecture (42-45). No crosslink was observed with untreated 
tRNA phe (Figure 7A, right panel) and with control 
proteins known to not bind RNA (see Supplementary 
Figure S7), confirming the specificity of the reaction. 
Analysis by urea-PAGE shows that the N3BC-treated 
tRNA also forms inter- or intramolecular RNA-RNA 
crosslinks in the absence of protein (see Supplementary 
Figure SI 2). 

In order to extend the N3BC application to RNAs 
differing from the canonical tRNA scaffold, we included 
several further RNA-protein interaction pairs in our 
study. One pair consisted in phage MS2 RNA 
(Figure 7B) and its partner MS2 protein (47,54) used as 
a fusion protein with Maltose Binding Protein 
(MS2-MBP). As a second model, we used two fragments 
of HIV-1 virus RNA containing the A7 splice site and its 
specific regulator, hnRNP Al protein (49,55) (Figure 7C). 
The first fragment (SLS2 RNA) of ~80nt contained an 
HIV-1 RNA region spanning nt 7965-8046 and exhibits 
an extended stem-loop structure. The second HIV-1 
derived transcript (SLS123 RNA) contained the SLS2 
region along with two additional stem-loop structures at 
5'- and 3'-extremities (nt 7903-8092 in HIV-1 RNA). Both 
fragments had previously been shown to bind hnRNP Al 
protein with high affinity and the binding sites for hnRNP 
Al had been determined by footprinting studies (39,49). 

5'- 32 P-labeled in vitro transcripts of these model RNAs 
were N3BC-treated, and, after UV-crosslinking at 365 nm, 



the resulting RNA-protein covalent adducts were 
analyzed by SDS-PAGE. As shown in Figure 7 (Panels 
B and C), all three studied RNAs were successfully 
crosslinked with their corresponding protein partners, 
demonstrating that N3BC-treated RNA transcripts may 
be used for such studies. Similar data were also obtained 
for one other binding partner of SLS2 and SLS123 
RNAs, namely hnRNP K protein (see Supplementary 
Figure SI 3). 

In order to assess the specificity of X-link using 
N3BC-treated RNA, we also performed experiments 
with the paL7ae protein, which specifically binds K-turn 
containing RNAs (like sRNA guides). This protein shows 
a faint but specific crosslink signal with Pab21 RNA, 
which contains such a structural motif. On the contrary, 
RNAs lacking a K-turn, such as tRNA phe or the 22mer 
fragment of P. abyssi rRNA, did not crosslink to L7ae 
upon irradiation at UV365 nm (see Supplementary 
Figure S14). The above findings clearly identify 
crosslinking reactions as a useful feature of the 
coumarin azide in various types of RNA. 



DISCUSSION 

Alkylation and acylation reactions of nucleobases have a 
long standing scientific history that includes the develop- 
ment of RNA sequencing techniques (56) and of structural 
probing [(57); for reviews see (37,58)]. Classical electro- 
philes such as dimethylsulfate are monofunctional and 
rather unspecific with respect to the targeted nucleophile. 
While modern electrophilic reagents are more selective, 
e.g. for ribose 2'-OH (59) or the N2 of guanosine (60), 
the specificity of N3BC for uridine is a unique feature 
among electrophiles; CMCT, the only commonly 
employed structural probe for uridines, also reacts with 
guanosine to an extend that allows its application in struc- 
tural probing of guanosine residues, while N3BC shows 
only trace reactivity with guanosine (37). 

The bidentate chloroacetaldehyde (61) is still one of the 
few electrophilic reagents, which contain or form chromo- 
phores of analytical benefit. More recently developed 
non-alkylating structural probes make use of light 
induced scission (62,63). Certain alkylating derivatives of 
aromatic azides have been used for crosslinking purposes 
(64). On the other hand, modern bioconjugate chemistry 
features the development of reagents with multiple 
functionalities in application such as activity-based 
protein profiling which may include functionalities for 
target binding, crosslinking, spectroscopic detection, 
affinity purification and others [e.g. (65), for reviews see 
(66,67)]. 

The N3BC reagent developed in this study combines 
alkylating reactivity with analytically useful spectroscopic 
properties and multifunctionality; its coumarin chromo- 
phore can be detected in UV absorption, and, after pho- 
tolysis of the azido function, by fluorescence. We have 
found the properties of N3BC and its RNA conjugates 
very convenient in investigations of RNA-protein inter- 
actions. RNA can be treated in large quantities, and the 
resulting RNA conjugates are stable over several months 
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Figure 7. Use of N3BC-treated RNA in RNA-protein crosslinking studies. (A) N3BC-treated tRNA phe crosslinks with RNA binding proteins upon 
irradiation. The secondary tRNA structure is shown on the left. 5'- 32 P-labeled N3BC-treated tRNA phe or similar amounts of control tRNA phe 
transcript were irradiated (as indicated above the gel image) for Omin or 30 min at 365 nm. RNA-protein crosslinked products were analyzed by 
SDS-PAGE shown in the middle panel. The following RNA binding proteins were used: P. furiosus Trml (pfTrml), P. abyssi TrmI (paTrml), 
S. cerevisiae Trm4 (scTrm4) and T. maritime! TruB (tmTruB). Gels were Coomassie Blue stained to verify similar loading of different proteins (shown 
at the bottom). The lane labeled 'C shows a tRNA sample irradiated in the absence of protein. The negative control in the right panel shows an 
SDS-PAGE of samples which were not treated with N3BC prior to crosslinking. A control experiment with proteins known not to bind to RNA is 
shown in Supplementary Figure SI 1 . (B) Secondary structure of MS2 RNA (left) and crosslinking of N3BC-treated MS2 RNA with the recombinant 
MS2-MBP fusion protein (right). Incubations, crosslinking and analysis were performed as described above for tRNA phe . (C) Secondary structure of 
HIV-1 derived SLS2 and SLS123 RNAs (left). The region corresponding to SLS2 RNA is shaded. Numbering corresponds to nucleotide numbering 
in the HIV-1 genome. Crosslinking of SLS2 and SLS123 RNAs to the recombinant hnRNP Al protein is shown in the right panel. Incubations, 
crosslinking and analysis were performed as described above for tRNA phe . 



at — 20°C, when thoroughly protected from light. Aliquots 
may be removed to verify the success of the modification 
reaction, for labeling or for crosslinking purposes. 

Previous azide-based RNA crosslinking methods 
required the presence of sulfur as reactive moiety for the 
introduction of the photoactive compound (23). Here, 
N3BC offers a much wider range of applications, 
because it can be used on native RNA, synthetic RNA, 
and in vitro transcripts alike. The crosslinking is con- 
ducted at 365 nm, a wavelength that does not damage 
the nucleobases. The RNA used here carried UN3C 
residues randomly distributed throughout the entire 
sequence, because the RNA structure was denatured in 
70% DMSO during the reaction with N3BC. However, 



we have also shown, that elements of secondary structure 
remain intact at 40% DMSO content, and that UN3C can 
be detected even in reactions conducted in 20% DMSO. 
Hence, future applications of N3BC may possibly be de- 
veloped, which contain crosslinking UNC3 moieties only 
at positions that are accessible in the native RNA 
structure. 

In addition to being a crosslinking moiety, the aromatic 
azide doubles as a conjugation module, onto which almost 
any desired functionality can be attached by click chemis- 
try. Although here exemplified with a fluorescent dye, 
click conjugation to alkyne derivatives conceivably 
allows attachment of other functional groups such as 
biotin. Advantages over other types of chemistry offered 
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by azide-alkyne click derivatization include the low con- 
centration of azide compounds and mild reaction condi- 
tions. While numerous examples of DNA derivatization 
by click chemistry have recently been published (27,68,69), 
its application in the RNA field is only about to become 
popular (18,21,30-32). We find that RNA is stable under 
in vitro click conditions and therefore we anticipate a 
widespread future use of the N3BC reagent. 
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